Revamp the end-of-test summary #4089

joanlopez · 2024-12-04T17:58:28Z

Overview

This pull request changes the current end-of-test summary by a new design, with two available formats:

a) compact (default)
b) full
aiming to bring clearer a more valuable results to users. Find a screenshot below.

User-facing details

💹 The appearance of the end-of-test summary is now different, with thresholds at the beginning (instead on being inlined with metrics), checks slightly modified and metrics grouped by category (http, ws, network, etc).
🔠 The user can choose between:
- the compact summary with --with-summary=compact, or with no argument, as it is the default choice.
- the full summary with --with-summary=full.
- the legacy summary with --with-summary=legacy.
⚠️ The data model passed into the custom handleSummary function is now different. So, those users relying on it must migrate their implementation or use --with-summary=legacy meanwhile.
- 🙏🏻 (Dear reviewer) If you definitely think we should keep the old format, I can give it a try and see if I can accommodate the new to it before calling the function, without many extra allocations (I guess that since we don't propagate sinks here it should be fine in general). But please, comment it explicitly, exposing the reasons and what's your concrete proposal.

Technical details (for review purposes)

The core logic of the new end of test summary, and how it collects metrics, etc is based on a new output.Output named summary.
There's a new test script as example under internal/cmd/testdata/summary/... with different scenarios, groups, thresholds, custom metrics and what not... that can be used for both automated and manual testing. If you think anything is missing, just suggest it.
The JS code responsible for rendering the summary have been largely refactored and type-documented, 👏🏻 big shot-out to @oleiade, aiming to make that code way more maintainable that it has been until now. I guess that, once we merge this PR, we may need to copy-past it back to https://github.com/grafana/k6-jslib-summary.
I left two data structures for the summary representation (lib.Summary vs lib.LegacySummary), to keep support for the legacy summary for some time, in an easy way, so things aren't complexity mixed and the the clean up in the future is simpler, just by removing that type, all the references to it, and simplifying the few conditionals that behave depending on which summary type is provided.
- Similarly, I left the old JS code for the summary as the summary-legacy.js, for simpler cleanup whenever we remove that support, which I guess might be for v2 (once we ship the formalized JSON output format within v1).

Internal Checklist

Before review readiness

General

I have performed a self-review of my code.
I have added tests for my changes.
I have run linter locally (make lint) and all checks pass.
I have run tests locally (make tests) and all tests pass.
I have commented on my code, particularly in hard-to-understand areas.

Co-authored-by: oleiade <[email protected]>

…ummary-output

…6 into new-end-of-test-summary-output

…ummary-output

olegbespalov

I did a pass on that, but need a couple of more for sure, commenting just signaling that I'm on it

internal/js/runner.go

lib/models.go

joanlopez · 2025-02-13T17:05:09Z

I did a pass on that, but need a couple of more for sure, commenting just signaling that I'm on it

Sure, thanks! Take your time!

olegbespalov · 2025-02-17T08:28:12Z

full aiming to bring clearer a more valuable results to users. Find a screenshot below.

Where can I find the screenshot for that? 🤔

I've tried to run in two modes and not sure if I see any difference

output/summary/summary.go

internal/cmd/run.go

internal/cmd/testdata/summary/browser.js

internal/js/summary.go

internal/lib/testutils/minirunner/minirunner.go

olegbespalov · 2025-02-17T09:27:54Z

lib/summary.go

+}
+
+// NewSummary instantiates a new empty Summary.
+func NewSummary() *Summary {


Out of the curiosity, why have we decided to use the empty summary constructor? I mean, why don't we ask require the mandatory values throw the constructor (and maybe even apply validation)?

I don't think I have a concrete answer, to be honest. I think the reason why I followed this approach is because it's mostly a DTO, and concretely a recursive one, so it felt easier to initialize it empty, like when you initialize a map, and populate it on the go, instead of asking for the inner data in the constructor.

Most of the logic is on the summary.Output side, but I preferred not to couple both, even if that's the main use, at least for now.

output/summary/data.go

Co-authored-by: Oleg Bespalov <[email protected]>

joanlopez · 2025-02-18T11:04:35Z

full aiming to bring clearer a more valuable results to users. Find a screenshot below.

Where can I find the screenshot for that? 🤔

I've tried to run in two modes and not sure if I see any difference

The key difference between full and compact modes is that the former also displays partial results for groups and scenarios, while the latter only displays total results. However, if there are no groups nor scenarios, then their appearance is the same.

Do you have any other suggestion to differentiate among them?
cc/ @oleiade do you have any other idea? Perhaps hiding some data? 🤔

To be fully transparent, I don't have a strong opinion here, but I'd advocate to either make any change w'all fully agree, or move forward as-is, to avoid looping in cycles. As far as I know, we offer no guarantee on the text summary format, so it should be fine to iterate it in the near future if needed. The one shipped as part of this PR doesn't need to be the definitive one before and along the v1.

olegbespalov · 2025-02-18T16:21:58Z

@joanlopez like I said internally, I'm totally fine with processing as it's with one exception that it's worth adjusting texting when it lands documentation

full - aiming to bring clearer a more valuable results to users. Find a screenshot below.

IMO too generic and vague and in my case it was the source of confusion since I expected to see these more valuable results since the beginning

joanlopez · 2025-02-18T16:25:14Z

@joanlopez like I said internally, I'm totally fine with processing as it's with one exception that it's worth adjusting texting when it lands documentation

full - aiming to bring clearer a more valuable results to users. Find a screenshot below.

IMO too generic and vague and in my case it was the source of confusion since I expected to see these more valuable results since the beginning

Nice, thanks for your input @olegbespalov!
Let's wait for @oleiade's input, but yeah, I'll definitely take that into consideration when writing the corresponding docs.

oleiade · 2025-02-19T13:30:28Z

Hey @joanlopez @olegbespalov 👋🏻

Apologies for the delay and being blocking here. I overlooked the compact vs full list of metrics during the last phases of the design, but doing a bit of digging into some of our initial design docs, I found back that we had come up with a candidate list of metrics to exclude in compact mode (and include in full/extended mode):

We exclude the following metrics from default results:
http_req_blocked
http_req_connecting
http_req_receiving
http_req_sending
http_req_tls_handshaking
http_req_waiting

The rationale there, being that we wanted to only show what is relevant to the vast majority of users in compact mode, and only bring them back when users explicit request the full mode. In general I remember that we discussed focusing on removing things that would only be relevant to a very small portion of users by default, or to very specific use-cases.

At the time we outlined only HTTP metrics, because that's where we thought the most of those cases were, but I think we should feel free to expand that list if we can consider other metrics as "not absolutely mandatory".

I also agree with @olegbespalov in that we are explicitly excluding the end of test summary from the v1.Y.Z support policy, and we should feel free to iterate on it in the future. So we don't have to block on this if there are diverging ideas about what the list of included/excluded metrics would be for instance: even if we kept the list at what it is now, that would be 👍🏻 for me 🙇🏻

Hope that's helpful, and again, great work @joanlopez I love it ❤️ 🚀

joanlopez and others added 14 commits October 21, 2024 13:05

Work in progress: new end-of-test summary

55617fa

Co-authored-by: oleiade <[email protected]>

More improvements for the new end-of-test summary

0d6eab2

Co-authored-by: oleiade <[email protected]>

Dummy summary output

a6d0c68

Co-authored-by: oleiade <[email protected]>

Start collecting metrics on summary output

f324c2c

Co-authored-by: oleiade <[email protected]>

Include groups & scenarios metrics on summary

a6b496b

Co-authored-by: oleiade <[email protected]>

Display nested groups in the summary

89379a7

Co-authored-by: oleiade <[email protected]>

Fix summary report metric values

40f3e1c

Co-authored-by: oleiade <[email protected]>

Push multi-scenario script example

e088fab

Co-authored-by: oleiade <[email protected]>

Fix end-of-test summary when there are no checks

589b0e6

Co-authored-by: oleiade <[email protected]>

Include nested checks to the end-of-test summary

099d0c4

Co-authored-by: oleiade <[email protected]>

Rename storeSample and addSample methods for clarity

9beea69

WIP

ce95d36

Store Threholds to Summary output and its report

6bceb9c

Print Threholds as part of the summary output

b0215a7

joanlopez assigned oleiade and joanlopez Dec 4, 2024

joanlopez requested a review from a team as a code owner December 4, 2024 17:58

joanlopez requested review from mstoykov and olegbespalov and removed request for a team December 4, 2024 17:58

joanlopez marked this pull request as draft December 4, 2024 17:58

oleiade added 7 commits December 10, 2024 14:26

Add JSDoc documentation to summary.js

5d30088

Apply JS linter recommendations to summary.js

b80a09c

Refactor metrics and thresholds rendering in summary.js

b2b5823

Import prettier+eslint configurations from docs and format summary.js

4153ba7

Factor decoration in a ANSIFormatter class

af4d6f4

Refactor text summary generation for simplicity and maintainability

8b1da1d

Reorganize the summary.js file for easier maintenance

126a188

oleiade force-pushed the new-end-of-test-summary-output branch from 58138ac to 126a188 Compare December 17, 2024 15:58

Fulfill JSDoc documentation of summary.js

63d89e0

joanlopez added 12 commits February 5, 2025 10:15

(Try to) fix some tests

cd633bc

Fix linter complaints

f62c2d4

Fix xk6 test

eee0c99

Merge remote-tracking branch 'upstream/master' into new-end-of-test-s…

da116ac

…ummary-output

Fix more linter complaints

eaf59f5

Small refactor

8cb412c

Merge branch 'new-end-of-test-summary-output' of github.com:grafana/k…

52ce9db

…6 into new-end-of-test-summary-output

Move full-summary example to internal/cmd/testdata

cf87057

Pass render options missing to ANSIFormatter

1bb7aaf

Accommodate tests to the new summary format

bf19d6c

Merge remote-tracking branch 'upstream/master' into new-end-of-test-s…

7a057b6

…ummary-output

Refine flaky test with undetermined amount of iterations

75af9ab

joanlopez force-pushed the new-end-of-test-summary-output branch from 1904999 to 4bb8f7a Compare February 7, 2025 21:11

Refactor runtime options init func

9c1ed70

joanlopez force-pushed the new-end-of-test-summary-output branch from 4bb8f7a to 9c1ed70 Compare February 7, 2025 21:14

joanlopez marked this pull request as ready for review February 7, 2025 21:59

olegbespalov added the breaking change for PRs that need to be mentioned in the breaking changes section of the release notes label Feb 13, 2025

olegbespalov reviewed Feb 13, 2025

View reviewed changes

internal/js/runner.go Outdated Show resolved Hide resolved

lib/models.go Show resolved Hide resolved

olegbespalov self-requested a review February 13, 2025 16:59

olegbespalov reviewed Feb 17, 2025

View reviewed changes

Apply suggestions from code review

58e3d02

Co-authored-by: Oleg Bespalov <[email protected]>

joanlopez added this to the v1.0.0-rc1 milestone Feb 18, 2025

Fix linter complaints

b8ed6e0

joanlopez force-pushed the new-end-of-test-summary-output branch from d2511c3 to b8ed6e0 Compare February 18, 2025 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revamp the end-of-test summary #4089

Revamp the end-of-test summary #4089

joanlopez commented Dec 4, 2024 •

edited

Loading

olegbespalov left a comment

joanlopez commented Feb 13, 2025

olegbespalov commented Feb 17, 2025

olegbespalov Feb 17, 2025

joanlopez Feb 17, 2025

joanlopez commented Feb 18, 2025

olegbespalov commented Feb 18, 2025

joanlopez commented Feb 18, 2025

oleiade commented Feb 19, 2025

Revamp the end-of-test summary #4089

Are you sure you want to change the base?

Revamp the end-of-test summary #4089

Conversation

joanlopez commented Dec 4, 2024 • edited Loading

Overview

User-facing details

Technical details (for review purposes)

Internal Checklist

olegbespalov left a comment

Choose a reason for hiding this comment

joanlopez commented Feb 13, 2025

olegbespalov commented Feb 17, 2025

olegbespalov Feb 17, 2025

Choose a reason for hiding this comment

joanlopez Feb 17, 2025

Choose a reason for hiding this comment

joanlopez commented Feb 18, 2025

olegbespalov commented Feb 18, 2025

joanlopez commented Feb 18, 2025

oleiade commented Feb 19, 2025

joanlopez commented Dec 4, 2024 •

edited

Loading